Unicode character property について

Words near each other

・ Unidade Arranho
・ Unidade Bombarral
・ Unidade Castelhanos Formation
・ Unidade Habitational de Sao Antonio
・ Unidade por Narón
・ Unidade real de valor
・ Unidale Mall
・ Unidan
・ Unidare RFC
・ Uniden
・ Uniden LPGA Invitational
・ Unidentified
・ Unicode and HTML for the Hebrew alphabet
・ Unicode anomaly
・ Unicode block
・ Unicode character property
・ Unicode collation algorithm
・ Unicode compatibility characters
・ Unicode Consortium
・ Unicode control characters
・ Unicode equivalence
・ Unicode font
・ Unicode in Microsoft Windows
・ Unicode input
・ Unicode subscripts and superscripts
・ Unicode symbols
・ Unicode Technical Standard
・ Unicoherent space
・ Unicoi
・ Unicoi County, Tennessee

Dictionary Lists

mini英和辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Unicode character property ：ウィキペディア英語版

Unicode character property
Unicode assigns character properties to each code point.〔(Unicode 6.0 chapter 4 )〕 These properties can be used to handle "characters" (code points) in processes, like in line-breaking, script direction right-to-left or applying controls. Slightly inconsequently, some "character properties" are also defined for code points that have no character assigned, and code points that are labeled like "". The character properties are described in Standard Annex #44.
Properties have levels of forcefulness: normative, informative, contributory, or provisional. For simplicity of specification, a character property can be assigned by specifying a continuous range of code points that have the same property.
==Name==
A Unicode character is assigned a unique Name (na).〔 The name, in English, is composed of uppercase letters A–Z, digits 0–9, - (hyphen-minus) and . Some sequences are excluded: names beginning with a space or hyphen, names ending with a space or hyphen, repeated spaces or hyphens, and space after hyphen are not allowed. The name is guaranteed to be unique within Unicode, and can be used to identify a code point and its character. Ideographic characters, of which there are tens of thousands, are named in the pattern "-''hhhh''". For example, . Formatting characters are named too: .
Starting from Unicode version 2.0, the published name for a code point will never change. In the event of a misspelling in a publication, a correct name will later be assigned to the code point as a Character Name Alias. Within the whole range of names, an alias is unique too.
Apart from these normative names, informal names can be assigned. These are usually other commonly used names for a character, used for illustration, but these informal names are not guaranteed to be unique.
These code points do not have a Name (na=""): Controls (General Category: Cc), Private use (Co), Surrogate (Cs), Non-characters (Cn) and Reserved (Cn). They may be referenced, informally, by a generic or specific meta-name, called "Code Point Labels": , , , , , . Since these labels contain <>-brackets, they can never appear as a Name, which prevents confusion.

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Unicode character property」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース